Picture for Yu-Gang Jiang

Yu-Gang Jiang

Fudan University

BraveGuard: From Open-World Threats to Safer Computer-Use Agents

Add code
May 31, 2026
Viaarxiv icon

CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping

Add code
May 29, 2026
Viaarxiv icon

VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models

Add code
May 28, 2026
Viaarxiv icon

Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation

Add code
May 24, 2026
Viaarxiv icon

Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance

Add code
May 22, 2026
Viaarxiv icon

GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

Add code
May 12, 2026
Viaarxiv icon

World Action Models: The Next Frontier in Embodied AI

Add code
May 12, 2026
Viaarxiv icon

Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval

Add code
May 10, 2026
Viaarxiv icon

Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models

Add code
Apr 20, 2026
Viaarxiv icon

ROSE: Retrieval-Oriented Segmentation Enhancement

Add code
Apr 15, 2026
Viaarxiv icon